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Abstract 

(N 

The busy period for a queue is cast as the area swept under the random walk until it first 
returns to zero, B. Encompassing non-i.i.d. increments, the large-deviations asymptotics 
of B is addressed, under the assumption that the increments satisfy standard conditions, 
^ including a negative drift. The main conclusions provide insight on the probability of a large 

busy period, and the manner in which this occurs: 
(i) The scaled probability of a large busy period has the asymptote, for any 6 > 0, 



H 



X 
S3 



lim log P{B > bn) = -XVb, 




^ where if = 2^/- j A{e) dO , with A* = sup{6' : h{9) < 0}, 

^ and with A denoting the scaled cumulant generating function of the increments process. 

(ii) The most likely path to a large swept area is found to be a simple rescaling of the path 
C\| on [0, 1] given by, 

> r{t) = -Hy{i-t))/x* . 

In contrast to the piecewise linear most likely path leading the random walk to hit a 
00 high level, this is strictly concave in general. While these two most likely paths have 

very different forms, their derivatives coincide at the start of their trajectories, and at 
their first return to zero. 

^-H These results partially answer an open problem of Kulick and Palmowski [16j regarding the 

7—i tail of the work done during a busy period at a single server queue. The paper concludes 

with applications of these results to the estimation of the busy period statistics {X*,K) 
^ based on observations of the increments, offering the possibility of estimating the likelihood 

of a large busy period in advance of observing one. 



1 Introduction 



Consider S = {Sk : A; > 0}, a random walk that starts at zero and has (not necessarily i.i.d.) 
increments process X = {Xn : n > 0}: 

k 

So := and Sk = ^Xi for A; > 1. (1) 

1=1 
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Figure 1: Relationship between A and ■0* 



Define the stopping time and stopped variable: 



r 



r := inf{A; > 1 : 5^ < 0} and B = > S, 



2=1 



The tail behavior of B (and related random variables) is of interest in several apparently distinct 
fields from queueing systems, to percolation, to insurance [T6 l [T8 l [T2]. 

By simple rescaling arguments, should the following limit exist, it must have this form: 

lim ^ log P{B > nh) = -Ky/b for some K >0. (3) 

This limit is established with > in [2] for the particular case of the the M/M/1 queue. By 
extending results in [T2] from a fixed terminal point to the random terminal point r via the 
infinite time-horizon sample path Large Deviation Principle setup in tl3j . Theorem [2] establishes 
the limit eq. ([s]) for a broad class of non-long-range dependent, light-tailed arrivals processes, 
providing a formula for K. The scaled Cumulant Generating Function (sCGF) associated with 
the scalar LDP is denoted 

A{e) = lim J log E{exp{9Sk)) for 9 £R. (4) 
Identifying A* = sup{0 : A(^) < 0}, Theorem [2] shows that under Assumptions [l] and [2| 

K = 2^-^ ' A{9)d0. (5) 

Its proof establishes that the most likely path to a large swept area is strictly concave. The 
most likely path that first returns to zero at t = 1 is identified to be 

V'*(t) = -^A(A*(l-t)) forte [0,1], 

with all other most likely paths being simple rescalings of ip*, as illustrated in Fig. [T] This is 
in contrast to the most likely path for the random walk to hit a high level, which is known to 
be piecewise linear, e.g. \1\ |14|. 

As an illustrative example, consider a random walk with i.i.d Gaussian increments for which 



everything is calculated in closed form in Sec. 4.1 Ten billion busy period paths were simulated 



and, of these, the one with the largest swept area as well as the one that attained the greatest 
height were recorded. In addition to plotting these paths in Fig. [2j conditioned on these values 
the most likely paths to these events are shown. The distinct shapes of the paths to these 
two unlikely events is apparent. Note that the terminal times of the theoretically predicted 
paths are deductions of the conditioned area and height, respectively, and are not constrained 
explicitly. Also shown is the logarithm of the empirical probability with which the busy period 
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Figure 2: I.i.d. Gaussian(-l/10,l) increments. Simulated results from ten billion paths. (Left 
panel) Four paths displayed. Simulated path with largest swept area 111,524.13 and, condi- 
tioned on that area, the most likely path to give rise to it as predicted by Theorem [2j which is 
concave. Also shown is the simulated path that reaches the greatest height 98.36 and, condi- 
tioned on that height, the most likely path to give rise to it as deduced from Theorem [T| which 
is piecewise linear. (Right panel) Logarithm of the empirical probability that the area under 
busy period exceeds a given level compared with theoretical prediction of —Ky/b and (as the 
asymptote does not capture the prefactor) the linear shift —KVb + k, where k is chosen to 
match the offset of the empirical observation 



exceeded a given value as compared to the large deviation estimate. Up to a constant prefactor, 
the approximation inferred from Theorem [2] is remarkably accurate. 

These results provide a partial answer to Open Problem 3.1 of |16j regarding precise asymptotics 
for the probability that B is large, by identifying the associated rough asymptotics and showing 
that most likely paths are typically strictly concave. They also reveal a lacuna in [S] Theorem 
4.1] where the most likely path is assumed to be piecewise linear. 

We conclude the paper in Sec. [5]with a discussion of the practical utility of these results. Given 
observations of the increments process X, estimates of the key quantities (A*, K) can be created 
that, under additional restrictions on X, can be shown to satisfy a large deviation principle. 
This offers the possibility of accurately estimating the likelihood of a long busy period in advance 
of observing one. 



2 Functional setup 

The framework for analysis used in this paper was first developed for weak convergence of 
probability measures [HI SJ [23], and subsequently used in the context of sample path large 
deviations, e.g. [TJ [T71 [251 [HI [13l [10] . I^i particular, Ganesh and O'Connell [Hj employed this 
setup to establish an infinite time horizon version of Anantharam's result [T], proving that the 
most likely path to exceed a high level for a random walk with negative drift is piecewise linear 
on the scale of large deviations. 

Let C[0,oo) denote the collection of real-valued continuous functions on [0, oo). Let A[0,oo) 
denote the collection of the integrals of functions that are elements of L^[0,x) for all x > (for 
example, see Riesz and Sz.-Nagy f2T]). For each r E M, define the space 

yr:=LeC[0,oo):l^^jff-=r] (6) 
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and equip it with the topology induced by the norm 



sup 

t>0 



l + t 



Define the polygonal sample paths 

1 / [nt] \ 

Suit) ■■= -S[nt] + y~~) ^H+i' * ^ [0,00). 

The sample path process {5'„(-)} is known to satisfy the LDP in for a broad class of non- 
long-range dependent, non-heavy-tailed random walks. For example, see |14| Theorem 1], where 
[5l Theorem 2] provides general mixing and uniform tail exponent conditions under which 
the prerequisites of this theorem hold. This encompasses increment processes that are Harris 
recurrent Markov chains, subject to a Foster-Lyapunov drift condition |15| . The existence of 
such an LDP will be the primary assumption in our proof of eq. Q . 



3 Tail asymptotics 

We shall make two assumptions. The first is the existence of a sample path LDP for the random 
walk that ensures the walk has negative drift, but a possibility of becoming positive on the scale 
of large deviations. 

Assumption 1 The sample path process {5'„,(-)} satisfies the LDP in y-s, some 5 > Q, with 
rate function 

= mt))dt ifci> G A[o, 00) n y^,, 

1 -|-oo otherwise, 

where I is the good, strictly convex rate function associated with the random walk {5^(1)} and 
I{x) < 00 for some x > 0. 

Our identification of the most likely path to a large swept area will have a surprising relationship 
with the most likely path to exceed a high level, so we recall the following result of Ganesh and 
O'Conneh. 

Theorem 1 ([14j) Under Assumption^ the following exist and are non-negative 

x* = arginf -I(x) and X* = —I(x*) = sup{9 : A(0) < 0}. (8) 

x>0 X X* 



Moreover, for any h > 0, 



lim -logP ( supS'fc >nh]= -hX*, (9) 

n->oo re V 



while the most likely path to this event is 



^*{t) = r' (10) 

\h + {t-h/x*){-6) ift>h/x*. 
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Note that the probabiHty of sweeping a large area, eq. ([s]), decays on a slower scale than the 
probability of hitting a high height, eq. ([9|. 

For the supremum of a random walk, A* determines the rate of decay of the probability of 
hitting a high level as shown in eq. ([9|. For busy periods, it will play a new and surprising 
role in the characterization of the most likely path to a large swept area. The inverse of V/ 
will prove central to the development that follows. To that end, our second assumption is the 
following regularity condition. 

Assumption 2 The rate function I is continuously differentiable on an interval that contains 
[-6,x*]. 

This assumption justifies the definition, 

H(r) =: (V/)~^(r), r G M, whenever the inverse exists. (11) 

The inverse exists, so that H is finite- valued, on an interval that contains [S(— 5), H(x*)] = [0, A*]. 
That A* is significant here stems from the second part of the following lemma. 

Lemma 1 The scalar A* defined in eq. ([s]) satisfies 

X* = VI{x*). 

Moreover, it is the unique positive solution of 

1 

E{X*s)ds = 0. (12) 







Proof: The identity V/(x*) = I{x*)/x* follows from the first-order optimality condition for 
X*, based on its definition in eq. ([s]). 

Clearly A* is positive as x* is positive and I{x) > for all x > —5. To see that A* thus defined is 



a solution of eq. (12), direct substitution, change of variables and integration by parts suffices. 



To see it is unique, note that it is equivalently characterized as any positive solution of 



/ H(s)ds = 0. 
Jo 



As / is strictly convex, H(a;) is strictly increasing when finite. At x = we have that H(0) 
—5 < 0, so this equation only has one positive solution. 



We will use A* to define the most likely path to sweep an area over [0, 1]: 

[(l-t)(5 fort>l. 



The most likely path to sweep any other area will be a simple rescaling of this solution. This 
path -0* is strictly concave on [0, 1] as / is strictly convex. On this interval it is of the form 



found in [12] in the analysis of simulation of queues: on differentiating each side of eq. (13), it 
follows that the path satisfies the simple differential equation, 

V/(|r(t))=A*(l-t) forte [0,1]. (14) 
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Note that, thus defined, we have that 



|V*(0)=S(A*)=:r* and |V* (1) = ^(0) 



-5. 



(15) 



That is, remarkably, the most hkely paths to sweeping a large area, ip* in eq. ( 13 ), and to exceed- 



ing a large height, ip* in eq. (10), both start and end with identical derivatives, but are distinct 



in-between. Before providing the main result, we establish the following characterizations of 
Proposition 1 The following hold for t G [0, 1].' 



(i) In contrast to eq. (13), a non-integral representation is obtained in terms of the rate 
function, 



(ii) In terms of the sCGF, 



and hence, 



X* 



r{t) = --A{x*{i-t)). 



(16) 



(17) 



Proof: The identity VA{6) = H(0) can be established for 9 £ [0, A*] based on convex duality 



(e.g, \22\ Proposition 11.3]). The representation in eq. (17) then follows by integration using 



■0*(O) = 0. The characterization in eq. (16) can be obtained from eq. (17) noting that A{9) 

eE{9)-i{E{e)) for e e [o,x*]. 



A simpler expression for A* is obtained when A is symmetric. This symmetry can be interpreted 
as asymptotic reversibility of the underlying walk. 

Proposition 2 Recall that X* > is a zero of A, —5 < is a zero of I, and x* > is the slope 
given in Theorem^ Under the symmetry condition, 

A{e) = A(A* - 9) for all G [0, A*], (18) 

then these parameters are related as follows, 

X* = 5 and X* = VI{6) = I{6)/6 = 2V/(0). (19) 



Moreover, if eq. (18) holds for all 9 £M., then 



Ux) — I(—x) 

X* = for all X such that I{x) < oo. 



X 



(20) 



Proof: If A{9) = A(A* - 9) for 9 e [0, A*], the path ?/;*(t) is symmetric with ^*{t) = ^/;*(1 - t). 
Thus ^ip*{t) = — t). In particular, evaluating this at t = and using eq. ( [I5| ) gives 



dt 



dt 



■0*(1) = S. Hence by eq. (^, A* = V/(5) = I{6)/5. Equating the derivatives 



at t = 1/2 gives 2H(AV2) = so that A* = 2V/(0). 



If eq. (18) holds for all 9, then 



I{-x) = sup {9{-x) - A(A* - 9)) = sup ((A* - 9){-x) - A{9)) = I{x) - X*x for all x. 



This gives eq. (20) for all x G M such that I{x) < oo. 
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For example, eq. (18) is satisfied if X is i.i.d., E{exp{9Xi) is finite in a neighborhood of 
the origin and P{Xi = x)/P{Xi = —x) = exp(— A*x) for all x, as holds for Xi Gaussian or 
Bernoulli-!— C, +C}. This condition, however, extends beyond i.i.d. increments processes and 
in Sec. [4] we present an example where eq. (18) is satisfied for a Markov chain X. 

Armed with these assumptions, definitions and characterizations of the path ip* , we now prove 
the main result. 

Theorem 2 Under the above assumptions, for any 6 > 0, 

lim - \ogP{B > n^b) = -KVI. 

n— >oo n 

where K is given in ([s]). Moreover, 

(i) The most likely asymptotic value ofr/n leading to {B > n^b} is 

a =: —Vb. 

(ii) The rescaled, most likely asymptotic path of Sn{-) is 

rdt) = ant/a) = -2^ A (a* - , (21) 

which is strictly concave on [0, a] . 

Proof: The method of proof is to construct a collection of open sets, {-Be}, and a closed set F 
in y^s such that for all e > sufficiently small 



and 



{Sni-) G B,} C{B> bn^} C {Sn{-) G F} 



lim lim inf - log P(S'„(-) G B^) = lim sup - log P(5„(-) G F). 

e^O n-s>oo n n-5-oo ri 



Define 



F := |(/> G 3^_5 : ^ max((/>(t), 0)dt > &| . 
UB> bn^, then 5„(-) G F as 

poo pnr -1 -1 

/ max(5n(t), 0)(it > / Sn{t)dt = — S2Si = ^B>b. 
Jo Jo ™ " 

The set F is closed as i— )• max((/), 0) is Lipschitz continuous from y^g — )• 3^o and, as for any 
(j) G 3^-5 we have (/>(t) < for all t sufficiently large, integration is also continuous (e.g. |24|, 
Theorem 11.5.1]). Thus we can use the LDP upper bound to obtain 

lim sup — log P{B > bn^) < lim sup - log P(S'„(-) G F) 

n—^oo ri n— >oo n 

= -inf{Ioo((/>) : </> G F} = - inf inf |/oo(</') : ^ max((/)(s), 0) > foj 
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As lim(_>oo (l>{t) /(I + ^) = ~^ < for all (f) G 3^_5, this infimum over t is attained at some finite 
t. Consider this inner functional infimum for fixed t: 

minimize Ioa{<P) 
subject to </> G L+[0,t] and ( max((/>(t), 0) > h. (22) 

^0 

This functional optimization problem is closely related to [121 eq. (6)], where one can identify 
b with z. Mild alterations to Proposition 7 therein shows that with h = a? ip*{t)dt for some 



a > 0, with il)*{t) defined in eq. (13), this infimum occurs at any t > a for which it transpires 
that ipl, defined in eq. (21), is the optimizer. This essentially occurs as 

■06 = arginf | [ I{^{t))dt : [ (t){t) = I i)*{t)dt 

I JQ Jo Jo 

The quantity A* in the definition of ■0* arises as a scalar Lagrange multiplier |12| . Note that 



iplia) = 0, and thus the most like 
6/ fQip*{t)dt. Using Proposition jl 



y path to sweep a rescaled area b satisfies r/re ^ a = 
we have that 

\*{t)dt = -^ [\{x*{i-t))dt = -j^ r Ai9)de 

'0 ^ Jo \^ ) Jo 

and thus the expression for a in the statement follows. 

What remains to be shown is that there is a coincident lower bound. Let ijj* be the unique 



solution of eq. (13) and define ipl using eq. (21). The path i^Kt) starts at ends at a, and is 
the optimal path that sweeps an area of ^l{t)dt = c? Jq ip*{t)dt = b. Let 

B, = Bl n Bl 

where, with e{t) = e(l + 1), 

Bl = L : inf m) - te) > o| and i?,^ = {</> : \\^ - {^b + e)|| < e} 

Both B] and B^ are open by construction and, for e sufficiently small, their intersection is 
non-empty. As defined, S'n(-) ^ Bl as ^^(O) = 0, but this is not significant as we can use an 
exponentially equivalent representation with 5„(0) := 1/n. Thus if e B^ with S'n(O) := 

1/n, then {B > 6n^}. As B^ is open for all e, we can use the LDP lower bound 

lim inf - log P{B > br?) > limlim inf - log P(S'„(-) € B,) 

> limlim inf - log P(S'„(-) e B^) 



liminf{/oo(0) : G Pe} = -loolV^b) 

e-S-O 



To evaluate K, note that 



lim ^ logP(B > bn") = -looirb) = -loJ^j£M^^Vb. (23) 



n— >oo n 



\/lorm 



Using A{9) = eE{e)-I{E{e)) for 9 G [0, A*], integration by parts and the fact that VA(A*) = 0, 
we have that 

1 2 '■^ 



/(V*(t))dt = -^ / A{e)de. 







Inserting this into eq. (|23|) in conjunction with the expression 

for/o V*(t) dt given above obtains 

the expression for K in ((5|). 
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4 Examples 

Theorem [2] provides a mechanism for calculating the most likely path to a large busy period 
B as well as the exponent K. We shall perform this calculation for illustrative examples: 
with i.i.d Gaussian increments where A*, ip* and K can all be determined in closed form; with 
Bernoullij— 1, +C} increments, which includes M/M/1 queue lengths, where explicit expressions 
of A* are not always possible, but tp* can be written in terms of it and K must always be 
calculated numerically; and, finally, for increments with Markovian dependencies where A* and 
ip* can be determined in closed form, but K must be identified numerically. 

For each of the examples ten billion paths were simulated. As well as recording the logarithm 
of the frequency with which B exceeded 6 as a function of b, the largest swept area and highest 
paths were logged for comparison with the theoretically predicted most likely paths. For com- 
parison with observations. Theorem [2] says that given we observe B = nPb, on the scale of large 
deviations the most likely time taken to generate the area is 

T^na = n2Vb/K = t* where t* =: \K/B-^'^ 

and the most likely path is then 

Si =: nSn {i/n) '(A^/„2 {i/n) = naip* {i/{na)) . 

Thus, since r* = an, for large swept area we have the approximation 

S,^T*r{iK). (24) 

This most likely path is solely parameterized from the observations by the value B. In particular, 
note that given B the time r is determined, so in the comparisons for the simulation results 
that follow, the length of the most likely path is not explicitly fit to data. 

Similarly, eq. ^ leads to the approximation 

log P{B >b)^ -KVb, for large b. (25) 

For contrast, we also record the path that reaches the highest height and use Theorem [T] for 
comparison. Given a path Sn that reaches a height H, Sn{-) reaches H/n and therefore 

c Q ^■/ ^~ *r/ ^ /^*^ iii<H/x*, 

bi = nbnii n) ^ nip [i n] = < 2d 

^' ' ' \H + {i-H/x*){-5) \ii>H/x*. ^ ' 

Again, given H, the time at which this most likely path returns to zero is completely determined. 

4.1 Gaussian increments 

Let X be i.i.d. Gaussian, with Xq having mean —5 < and variance cr^. As X is i.i.d., we 
have that 

2 

A{9) = y 02 _ 



10 




Figure 3: Bernoulli {— 1,+1} increments with P{Xi = —1) = 0.6, P{Xi = +1) = 0.4. Sim- 
ulated results from ten billion paths. (Left panel) Four paths displayed. Simulated path with 
largest swept area 15,825 and, conditioned on that area, most likely path to give rise to it 
predicted by Theorem [2| which is concave. For contrast, also shown is the simulated path that 
reaches the greatest height, 53, and, conditioned on that height, the most likely path to give 
rise to it as deduced from Theorem [l| which is piecewise linear. (Right panel) Logarithm of the 
empirical probability that area under busy period exceeds a given level compared with —K^/b 
and, the linear shift —K^-\-k, where k is chosen to match the offset of the empirical observation 



Using Theorem [2] we obtain the following explicit expressions 



A* = sup{^ : k{e) < 0} 



25 



a'- 



and = 2 




A' ^3/2 

K{e)d9 ' 



The most likely path to {i3 > 6n^} for large n can also be determined to be 




A demonstration of these results appears in Fig. [2] The highest and largest swept area paths 
are compared with the most likely conditioned on these quantities using the approximations in 



eq. (24) and eq. (26). The distinct nature of two paths to these two unlikely events is evident. 



The empirical likelihood of a large deviation is shown along with that from the approximation 
above, which gives remarkably good agreement up to a constant prefactor. 



4.2 Bernoulli { — 1,+C} increments 

Let X be a Bernoulli sequence taking values —1 and C with a = 1 — fi = P{Xq = C} < 
P{Xq = —1} = fi, so that —5 = aC — fj,, and define the load p = aC/fj,. For this process 

A{e) = log (^fie-^ + ae^^) 

and, by eq. ([s]). A* is the unique positive solution of 

ae^"'^ + iie^* -1 = 0. 
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Figure 4: Bernoulli {-1,+10} increments with P{Xi = -1) = 0.96, P{Xi = +10) = 0.04. 
Simulated results from ten billion paths. (Left panel) Four paths displayed. Simulated path 
with largest swept area 36, 971 and, conditioned on that area, most likely path to give rise to 
it predicted by Theorem [2| which is concave. For contrast, also shown is the simulated path 
that reaches the greatest height, 143, and, conditioned on that height, the most likely path to 
give rise to it as deduced from Theorem [T| which is piecewise linear. (Right panel) Logarithm 
of the empirical probability that area under busy period exceeds a given level compared with 
—K\fh and, the linear shift —K\fh + k where k is chosen to match the offset of the empirical 
observation 



Closed form expressions for A* do not exist apart from in a few special cases. If C = 1, A satisfies 
the conditions of Proposition [2| the most likely path is symmetric and this equation has the 
explicit solution A* = — log(/9), which could also have been determined by 2V/(0) or I{5)/5. 
Given the reversibility of the M/M/1 queue, the symmetry of '0* established in Proposition [2] 
is not surprising. 

If C > 2, then the corresponding reflected random walk is not reversible, and moreover the 
increments do not satisfy the symmetry assumptions of Proposition [2j For C = 2 we obtain the 
expression. 




-3/Li^ + 2/i + 1 - a 
2a 



while if C = 3 



A* = log 



7z 



where z 



^2(/i- 1) ~ 54y 729 ~ 2(^- 1) 54' 



Using eq. (17), for arbitrary C we conclude that the most likely path on [0, 1] can be written in 



terms of A* as 



log(^e-^*(i-*) + ae^^*(i-*) 



In order to calculate K in equation ([s]), we need to evaluate the integral /q A(6)d6. This 
doesn't result in a closed form for any C, but it is simple to evaluate numerically. 

In order to determine the most likely time and paths to a large busy period and a great height on 



the scale of large deviations we use the approximations eq. (24) and eq. i26h. For the reversible 



Bernoulli { — 1,+1} case corresponding to M/M/1 queue- lengths. Fig. ^ compares the highest 
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and biggest paths with those from theory. The quahty of the predictions is apparent. With a 
numerical integration of A giving K « 0.1485, the asymptotic approximation is compared with 
the empirical probability, showing great accuracy up to a constant prefactor. 

As an example of a non- reversible random walk, we consider the Bernoulli { — 1, +10} case where 
the most likely paths are now asymmetric (as seen in Fig. |4]). For this example. A* ~ 0.1439 
and K ~ 0.0978, both of which have been determined numerically. 



4.3 Markovian { — increments 

As an example beyond i.i.d. increments, assume that the increments process X forms a two-state 
Markov chain on the state space { — 1, +1} with transition matrix 

^ 1 /3 ) where < a < /3 < 1 . 

The stationary distribution is (/3/(a + (3), a/ {a + /3)) so we require a < f3 for stability. The 
sCGF A can be calculated using techniques described in [6, Section 3.1]: 



A(^) = log 



' (1 - a)e-^ + (1 - P)e^ + V4a/3 + ((1 - a)e-^ - (1 - /3)e^)2) ^ 



which satisfies the conditions of Proposition [2] and so the most likely path is symmetric. The 
rate function for the associated random walk can be calculated using methods described in [71 . 
For example, from [9] we have that 

/(x) = - (^^) log(l - « + - (^) log(l - /3 + (3/x{x)) 

where 



a/3x + v^a2^2a,2 + ^^(i + x)(l - a)(l - /3){1 - x) 
^^""^ ~ a(l-/3)(l-x) ■ 

One can check directly that 

X* = -p = 5 = and A* = I{x*)/x* = VI{x*) = 2V/(0) = sup{0 : A(6') < 0} = log ' ^ ~ " 



We have the following expression of the most likely path returning to at t = 1 , which can be 
seen directly to possess the symmetry ili*{t) = — t) for t G [0, 1], 



log 



(1 - a)*(l - + (1 - a)i-*(l - pf + ^Aap + ((1 - q)*(1 - - (1 - a)i-*(l - pfY) 



The integral A(^) dO does not evaluate in closed form, so again numerics must be used to 
determine K. For example, if a = 2/10 and p = 3/10, then 6 = 2/10, x* = 2/10, A* = log(8/7) 
and K ~ 0.0489. A simulation-based illustration of these results appears in Fig. [5} 
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Biggest area (theory) 
Biggest area (observed) 



20 X II)'' 



log P{B > b} 



Observed - 
Theory ■ 
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Figure 5: Markov {-1,+1} increments with P{Xt+i = +l\Xt = -I) = a = 2/10, P{Xt+i = 
—l\Xt = +1) = 13 = 3/10. Simulated results from ten billion paths. (Left panel) Four paths 
displayed. Simulated path with largest swept area 143, 875 and, conditioned on that area, most 
likely path to give rise to it predicted by Theorem [2| which is concave. For contrast, also shown 
is the simulated path that reaches the greatest height, 139, and, conditioned on that height, the 
most likely path to give rise to it as deduced from Theorem [l| which is piecewise linear. (Right 
panel) Logarithm of the empirical probability that area under busy period exceeds a given level 
compared with —KVb and, the linear shift —K^/h + k where k is chosen to match the offset of 
the empirical observation 



5 Conclusions and discussion on estimation 

As well as identifying the most likely path to a large swept area. Theorem [2] shows that in the 
absence of long range dependence and heavy tailed increments, in broad generality we have the 



approximation in eq. (25): 



log P{B > 6) ~ —KVh, for large 6, 

where K can be identified in terms of the sCGF A associated with the increments X. An 
approximation such as this might be of value for practical purposes, but unless X is known in 
advance we would require a methodology to estimate K from observations of the system. 

Based on thermodynamic ideas, Duffield et al. [H| investigated an estimation scheme for A based 
on observations of X. They demonstrated empirically that the scheme has desirable properties 
for a large class of of increment processes. Indeed, if X consists of i.i.d. bounded random 
variables [9l Theorem 1] or a finite state Markov chain |1H Theorem 3] , from observations of X 
one can construct consistent functional estimates, {A,i}, of A that themselves satisfy a LDP in 
the space of M- valued convex functions on M. From these, we can deduce an LDP for estimating 
K from observations of X as follows. If X is i.i.d, define 



K{0) = log[^-±e^-'^ 



If X forms a finite state Markov chain with an irreducible transition matrix on {/(I), . . . , /(M)} 
where f{i) ^ f{j) for i ^ j, then with 0/0 defined to be we define an empirical transition 
matrix with entries. 
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and let Dg denote the matrix with diagonal entries exp(^/(l)), . . . , exp(0/(M)) and all off- 
diagonal entries equal to zero. Then our estimate of A given n observations is 

Anie)=logpiPnDg), 

where where p is the spectral radius. In both cases we define the estimates 
a; = sup{e : Anie) < 0} and Kn = 2^-^ " Anie)d9. 

Regarding the A* estimates, [Sj Lemma 1] proves that the function g{c) = sup{9 : c{9) < 0} is 
continuous on the space of convex functions equipped with the topology of uniform convergence 
on compact subsets at all c such that c(0) = and there does not exist a k > so that c{6) = 
for 9 G [0,k]. Thus from the LDP for the estimators {A„}, if the support of Xf^ excludes a 
finite ball around the origin and if f{k) ^ for any k in the Markovian case, we have an 
LDP for {A*} by Puhalskii's extension of the contraction principle |2m Theorem 2.1]. As the 
convex functions of interest are real-valued, the function c i— t- /q^^^^ c{9)d9 is also continuous 
so that again Puhalskii's extension of the contraction principle applies and we have an LDP 
for {Kn}. Consistency of the A estimators ensures consistency of the A* and K estimators. 
Thus these A-estimators enable the estimation of K directly from observations of X, offering 
the possibility of estimating the probability of a system experiencing an exceedingly large busy 
period in advance of one occurring. 
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